HUKB at NTCIR-12 IMine-2 Task: Utilization of Query Analysis Results and Wikipedia Data for Subtopic Mining
نویسنده
چکیده
Query understandings is a task to identify the important subtopics of a given query with vertical intent. In this task, characteristic keywords extracted from query analysis results and Wikipedia are used as candidates for the subtopics. From these candidates, topic-model based on the web documents retrieved by an original query is used for selecting appropriate subtopics, Vertical intent is judged mainly by the typical keyword list used for the particular vertical intent. For the Image, News and Shopping, the system checks type of retrieved documents that are estimated by using ALT value of IMG tag, anchor text and site list for URLs for vertical intent estimation.
منابع مشابه
Université de Montréal at the NTCIR-11 IMine Task
In this paper, we describe our participation to the NTCIR-11 IMine task, for both subtopic mining and document ranking sub-tasks. We experimented a new approach for aspect embedding which learns query aspects by selecting (good) expansion terms from a set of resources. In our participation, we used five representative resources: ConceptNet, Wikipedia, query logs, feedback documents and query su...
متن کاملSEM13 at the NTCIR-11 IMINE Task: Subtopic Mining and Document Ranking Subtasks
In this paper, we describe our participation in the English Subtopic Mining and Document Ranking subtasks of the NTCIR-11 IMINE Task. In the Subtopic Mining subtask, we mine subtopics from query suggestions, query dimensions, and Freebase entities of a given query, rank them based on their importance for the given query, and finally construct a two-level hierarchy of subtopics. In the Document ...
متن کاملYJST at the NTCIR-12 IMine-2 Task
Yahoo Japan Search Technology (YJST) team participated in the Query Understanding subtask of NTCIR-12 IMine-2. We explored various search log mining techniques to discover subtopics against the given original topics. For Vertical Identification, we trained a Gradient Boosted Decision Tree (GBDT) learner to identify a vertical label to each subtopic using several complex features including topic...
متن کاملIMC at the NTCIR-12 IMine-2 Query Understanding Subtask
This paper describes the participation of IMC team in the Chinese Query Understanding Subtask in the NTCIR-12 IMine-2 Task. To identify the subtopics of a given query, we utilize several data resource and innovatively employ new words extraction theory to obtain the expansion terms for a query, which is the kernel of the proposed system. Then we generate the query subtopic based on the expansio...
متن کاملTUTA1 at the NTCIR-11 IMine Task
In this paper, we detail our participation in two subtasks: subtopic mining and document ranking of the NTCIR-11 IMine task. In the subtopic mining subtask, to discover the latent hierarchy among query-like strings, our key idea is to structurally parse query-like strings by characterizing pairwise dependency in the bag-of-units perspective. Then the clustering algorithm (i.e., affinity propaga...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016